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Abstract 

In this paper we present an implicit dynamic dictionary with the working-set property, support- 
ing insert(e) and delete(e) in C(logn) time, predecessor (e) in C)(log£p(g)) time, successor(e) in 
0(log4(e)) time and search(e) in (log min(^p(e),i?e, 4(e))) time, where n is the number of ele- 
ments stored in the dictionary, £e is the number of distinct elements searched for since element e 
was last searched for and p(e) and s(e) are the predecessor and successor of e, respectively. The 
time-bounds are all worst-case. The dictionary stores the elements in an array of size n using no 
additional space. In the cache-oblivious model the log is base B and the cache-obliviousness is 
due to our black box use of an existing cache-oblivious implicit dictionary. This is the first impli- 
cit dictionary supporting predecessor and successor searches in the working-set bound. Previous 
implicit structures required C(logn) time. 
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Keywords and phrases working-set property, dictionary, implicit, cache-oblivious, worst-case, 
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\\ Introduction 

In this paper we consider the problem of maintaining a cache-oblivious implicit dictionary |13j 
with the working-set property over a dynamically changing set P of |P| = n distinct and 
totally ordered elements. We define the working-set number of an element e € P to be 
= |{e' S P I we have searched for e' after we last searched for e}|. An implicit dictionary 
maintains n distinct keys without using any other space than that of the n keys, i.e. the data 
structure is encoded by permuting the n elements. The fundamental trick in the implicit 
model, [12], is to encode a bit using two distinct elements x and y: if min(x,?/) is before 
max(a;, y) then x and y encode a bit, else they encode a 1 bit. This can then be used to 
encode / bits using 21 elements. The implicit model is a restricted version of the unit cost 
RAM model with a word size of O(logn). The restrictions are that between operations we 
are only allowed to use an array of the n input elements to store our data structures by 
permuting the input elements, i.e., there can be used no additional space between operations. 
In operations we are allowed to use 0(1) extra words. Furthermore we assume that the 
number of elements n in the dictionary is externally maintained. Our structure will support 
the following operations: 

mt Search(e) determines if e is in the dictionary, if so its working-set number is set to 0. 
B Predecessor(e) will find max{e' e PU {— oo} | e' < e}, without changing any working-set 
numbers. 

H Successor(e) will find min{e' e P U {oo} | e < e'}, without changing any working-set 
numbers. 
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Table 1 The operation time and space overhead of important structures for the dictionary 
problem. Here e* is the predecessor or successor in the given context. In a search for an element e 
that is not present in the dictionary Is is n. 



H Insert(e) inserts e into the dictionary with at working-set number of 0, all other working-set 
numbers are increased by one. 

H Delete(e) deletes e from the dictionary, and does not change the working-set number of 
any element. 

There are numerous data structures and algorithms in the implicit model which range from 
binary heaps [16J to in-place 3-D convex hull algorithms [6J. There has been a continuous 
development of implicit dictionaries, the first milestone was the implicit AVL-tree [T2] having 
bounds of 0(log^n). The second milestone was the implicit B-tree [7] having bounds of 
O (log^ ri/ log log n) the third was the flat implicit tree [S] obtaining 0{logn) worst-case time 
for searching and amortized bounds for updates. The fourth milestone is the optimal implicit 
dictionary ?8 obtaining worst-case C'(logn) for search, update, predecessor and successor. 

Numerous no n- implicit dictionaries attain the working-set property; splay trees |15j . skip 
list variants [2], the working-set structure in [TT], and two structures presented in [5]. All 
achieve the property in the amortized, expected or worst-case sense. The unified access 
bound, which is achieved in [T], even combines the working-set property with finger search. 
In finger search we have a finger located on an element / and the search cost of finding say 
element e is a function of c?(/, e) which is the rank distance between elements / and e. The 
unified bound combines these two to obtain a bound of C'(minegp{log(4 + d{e, f) + 2)}). 
Table [T] gives an overview of previous results, and our contribution. 

The dictionary in |8j is, in addition to being implicit, also designed for the cache-oblivious 
model [ini, where all the operations imply 0{loggn) cache-misses. Here B is the cache-line 
length which is unknown to the algorithm. The cache-oblivious property also carries over 
into our dictionary. Our structure combines the two worlds of implicit dictionaries and 
dictionaries with the working-set property to obtain the first implicit dictionary with the 
working-set property supporting search, predecessor and successor queries in the working-set 
bound. The result of this paper is summarized in Theorem [T] 

► Theorem 1. There exists a cache-oblivious implicit dynamic dictionary with the working-set 
property that supports the operations insert and delete in time 0{logn) and ©(log^ n) cache- 
misses, search, predecessor and successor in time C'(logmin(i?p(g),€e,^5(e) 

)), 0(log V)) and 

0(log4(e)), and cache-misses ©(log^ min(£p(e), 4, 4(e))), C'(logs4(e)) and ©(log^ 4(e)), 
respectively, where p(e) and s(e) are the predecessor and successor of e, respectively. 
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Similarly to previous work [UIJ we partition the dictionary elements into O (log log n) blocks 
Bq, . . . , B„i, of double exponential increasing sizes, where Bq stores the most recently accessed 
elements. The structure in [i] supports predecessors and successors queries, but there is no 
way of knowing if an element is actually the predecessor or successor, without querying all 
blocks, which results in 0(logn) time bounds. We solve this problem by introducing the 
notion of intervals and particularly a dynamic implicit representation of these. We represent 
the whole interval [min(P); max(P)] by a set of disjoint intervals spread across the different 
blocks. Any point that intersects an interval in block Bi will lie in block Bi and have a 
working-set number of at least 2^ . This way when we search for the predecessor or successor 
of an element and hit an interval, then no more points can be contained in the interval in 
higher blocks, and we can avoid looking at these, which give working-set bounds for the 
search, predecessor and successor queries. 

fl Data structure 

We now describe our data structure and its invariants. We will use the moveable dictionary 
from [4 as a black box. The dictionary over a point set S is laid out in the memory addresses 
It supports the following operations in O(logn') time and O(log^n') cache- misses, 
where n' = j — i + 1: 

H Insert-left(e) inserts e into S which is now laid out in the addresses [i — 
B Insert-right(e) inserts e into S which is now laid out in the addresses + 1]. 
H Delete-left(e) deletes e from S which is now laid out in the addresses [i + 1; j]. 
H Delete-right(e) deletes e from S which is now laid out in the addresses — 1]. 
H Search (e) determines if e S S, if so the address of element e is returned. 
H Predecessor(e) returns the address of the element max{e' G S" | e' < e} or that no such 
element exists. 

H Successor(e) returns the address of the element min{e' e S" | e < e'} or that no such 
element exists. 

From these operations we notice that we can move the moveable dictionary, say left, by 
performing a delete-right operation for an arbitrary element and re-inserting the element 
again by an insert-left operation. Similarly we can also move the dictionary one position to 
the right. 

Our structure consists of m = O(loglogn) blocks Bq, . . . , Bm, each block Bi is of size 
0{2^ ), where A: is a constant. Elements in Bi have a working-set number of at least 
2^ . The block Bi consists of an array Di of Wi = d ■ 2*+'^ elements, where c? is a constant, 
and moveable dictionaries Ai,Ri,Wi,Hi, Ci and G^, for i = 0, . . . , m — 1, see Figure [T] For 
block Bjn we only have if |i?m\{min(P), max(P)}| < Wm, otherwise we have the same 
structures as for the other blocks. We use the block Di to encode the sizes of the movable 
dictionaries Ai, Ri,Wi, Hi,Ci and Gi so that we can locate them. Discussion of further 
details of the memory layout is postponed to Section [3] 

We call elements in the structures Di and Ai for arriving points, and when making a 
non-arriving point arriving, we will put it into Ai unless specified otherwise. We call elements 
in Ri for resting points, elements in Wi for waiting points, elements in Hi for helping points, 
elements in Ci for climbing points and elements in Gi for guarding points. 

Crucial to our data structure is the partitioning of [min(P); max(P)] into intervals. Each 
interval is assigned to a level and level i corresponds to block Bi. Consider an interval 
lying at level i. The endpoints ei and 62 will be guarding points stored at level 0, . . . , i. 
All points inside of this interval will lie in level i and cannot be guarding points, i.e. 
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Figure 1 Overview of how the working set dictionary is laid out in memory. The dictionary 
grows and shrinks to the right when elements are inserted and deleted. 
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Figure 2 The structure of the levels for a dictionary. The levels are indicated to the left. 



]ei; e2[n(ljj_^j Bj U Gi) = 0. We do not allow intervals defined by two consecutive guarding 
points to be empty, they must contain at least one non-guarding point. We also require 
min(P) and max(P) to be guarding points in Go at level 0, but they are special as they do 
not define intervals to their left and right, respectively. A query considers Bq, Bi, . . . until 
Bi where the query is found to be in a level i interval where the answer is guaranteed to 
have been found in blocks Bq, . . . , Bi. 

The basic idea of our construction is the following. When searching for an element it is 
moved to level 0. This can cause block overflows (see invariants l|5]-lj9]in Section 2.2 1, which 
are handled as follows. The arriving points in level i have just entered from level « — 1, and 
when there are 2^'^*^ of them in Ai they become resting. The resting points need to charge 
up their working-set number before they can begin their journey to level i + 1 . They are 
charged up when there have come 2^ further arriving points to level i, then the resting 
points become waiting points. Waiting points have a high enough working-set number to 
begin the journey to level i + but they need to wait for enough points to group up so that 
they can start the journey. When a waiting point is picked to start its journey to level i + 1 
it becomes a helping or climbing point, and every time enough helping points have grouped 
up, i.e. there is at least c = 5 consecutive of them, then they become climbing points and are 
ready to go to level i -I- 1. The climbing points will then incrementally be going to level i + I. 
See Figure |2] for an example of the structure of the intervals. 



2.1 Notation 

Before we introduce the invariants we need to deflne some notation. For a subset S' C P, we 
define Ps{e) — max{s & S U {— oo} | s < e} and Ss(e) = min{s G SU {oo} \ e < s}. When 
we write S<i we mean IJ^-^q Sj where Sj C P for j = 0, . . . ,i. 

For 5* C P, we define GIL5(e) = 5n]pp^5(e); e[ to be the Group of Immediate Left points 
of e in which does not have any other point of P\S in between them, see Figure [s] Similarly 
we define GIR5(e) = Sn]e; Sp\s{e)[ to the right of e. We will notice that we will never find all 
points of GILs(e) unless |GIL5(e)| < c, the same applies for GIRs(e). For 5 C P, we define 
FGL5(e) = S'n]pp^g(p5(e)); P5(e)] to be the First Group of points from S Left of e, i.e. the 
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group does not have any points of P\S in between its points, see Figure [3] Similarly we 
define FGRs(e) = 30 [ss{e);sp\s{^s{^))[- We will notice that we will never find all points of 
FGL5(e) unless |FGLs(e)| < c, the same apphes for FGR5(e). 

We will sometimes use the phrasings a group of points or e 's group of points. This refers 
to a group of points of the same type, i.e. arriving, resting, etc., and with no other types of 
points in between them. Later we will need to move elements around between the structures 
Di, Ai, Ri, Wi, Hi, Ci and Gi. For this we have the notation X ^ Y, meaning that we 
move h arbitrary points from X into Y , where X and Y can be one of Di, Ai, Ri, Wi, Hi, 
Ci and Gi for any i. 

When we describe the intervals we let ]a; b] be an interval from a to 6 that is open at a 
and closed at b. We let (a; b) be an interval from a to 6 that can be open or closed at a and 
b. We use this notation when we do not care if a and b are open or closed. In the methods 
updating the intervals we will sometimes branch depending on which type an interval is. For 
clarity we will explain how to determine this given the level i of the interval and its two 
endpoints ei and 62- The interval (ei; 62) is of type [ei; 62) if ei S Gi, else ei G G<i-i and 
the interval is of type ]ei; 62). This is symmetric for the other endpoint 62- 



2.2 Invariants 

We will now define the invariants which will help us define and prove correctness of our 
interface operations: insert(e), delete(e), search(e), predecessor(e) and successor(e). We 

maintain the following invariants which uniquely determine the interval^ 

1.1 A guarding point is part of the definition of at most two interval^ one to the left at 
level i and/or one to the right at level j, where i ^ j. The guarding point e lies at level 
min(i, j). The interval at level min(z, j) is closed at e, and the interval at level max(i,j) 
is open at e. We also require that min(P) and max(P) are guarding points stored in Go, 
but they do not define an interval to their left and right, respectively, and the intervals 
they help define are open in the end they define. A non-guarding point intersecting an 
interval at level i, lies in level i. Each interval contains at least one non-guarding point. 
The union of all intervals give ] min(P); max(P)[. 

1.2 Any climbing point, which lies in an interval with other non-climbing points, is part of a 
group of at least c points. In intervals of type [ei; 62] which only contain climbing points, 
we allow there to be less than c of them. 



We assume that |P| = n > 2 at all times if this is not the case we only store Go which contains a single 
element and we ignore all invariants. 
^ Only the smallest and largest guarding points will not participate in the definition of two intervals, all 
other guarding points will. 

Pp\s(Ps(ei)) Ps(ei) ei Pp\s(e2) 62 

i i i i i 

00 • • ••000 o»«« • o 



FGLs(ei) GILs(e2) 
Legend: • S o P\S ° P 

W Figure 3 Here is a illustration of FGL and GIL. Notice that GILs(ei) = whereas FGLs(ei) 7^ 
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1.3 Any helping point is part of a group of size at most c — 1. A helping point cannot have a 
climbing point as a predecessor or successor. An interval of type [ei; 62] cannot contain 
only helping points. 

We maintain the following invariants for the working- set numbers: 

1.4 Each arriving point in Di and Ai has a working set value of at least 2^' , arriving 
points in Dq and Aq have a working-set value of at least 0. Each resting point in Ri 
will have a working-set value of at least 2^ + \Ai\, resting points in Rq have a 
working-set value of at least \Aq\. Each waiting, helping or climbing point in Wi,Hi and 
Ci , respectively, will have a working-set value of at least 2^ . Each guarding point in 
Gi, who's left interval lies at level i and right interval lies at level j, has a working set 
value of at least 2^ * ^' ^ . 

We maintain the following invariants for the size of each block and their components: 

1.5 l-Dol = niin(|i?o| — 2, wq) and \Di\ = min(|i?i|, Wi) for i = 1, . . . , m. 

1.6 \R,\ < 22'+' and \W,\ + \H,\ + \Q\ \R,\ = 22'+' for i^O,...,m. 

1.7 \A,\ + \W,\ ^ 2^'^' for 1 = 0,..., m- 1, and \A,^\ + \W„,\ < 2^"'^". 

1.8 I Ail < 22'+' fori^O, ...,TO. 

1.9 \H^\ + \C,\ = 4c22'+' + where e [-c; c], for i = 0, . . . , m - 1. 
From the above invariants we have the following observation: 

0.1 From l{l]all points in Gi are endpoints of intervals in level i, and each interval has at 
most two endpoints. Hence for z = 0, . . . , m we have that 

|G,| < 2(|A| + \A\ + \R,\ + \W,\ + \H,\ + \C,\) < (4 -f 2d+8c) ■ 22'+' + 2c , 
where we in (*) we have used l(5] ijo] l(7]and ijo) 
From l|l]we have the following lemma. 

► Lemma 1. Let e be an element, ei = Pg< (g); 62 = SG<i(e) and i be the smallest integer 
for which /(ei, 62, i) —]ei; e2[n [Sj=o ^ ^- Then 1) (ei; 62) is an interval at level i if e is 
non-guarding and 2) (ei; e) or (e; 62) is an interval at level i if e is guarding. 



Proof. Assume that i is the minimal i that fulfills /(ei,e2,i) 7^ 0, where ei = Pg< (s) ^-^d 
62 — SG<.(e). We will have two cases depending on if e is guarding or not. 

Lets first handle case 2) where e is guarding and hence in the dictionary: Since e is in 
the dictionary and ei < e < 62 we have from the minimality of i that e lies in level i, and 
from I^e is then part of an interval lying in level i either to the left or to the right. Say e is 
part of an interval to the left i.e. the interval {e'i \ e). If ei < e'l then this would contradict 
that ei = PG<i(e) hence e'l < ei, but since e[ is the predecessor of e we have that e[ = ei. 
So we know that (ei; e) defines an interval at level i. The argument for (e; 62) is symmetric. 

In the case 1) e is non-guarding and e may lie in the dictionary or not: Since ei < e < 62 
we have from the minimality of i that e lies in level i, hence from ijljwe have that the interval 
(ei;e2) lies at level i. -4 
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2.3 Operations 

We will briefly give an overview of the helper operations and state their requirements (R) and 
guarantees (G), then we will describe the helper and interface operations in details. Search(e) 
uses the helper operations as follows: when a search for element e is performed then the 
level i where e lies is found using find, then e and 0(1) of its surrounding elements are moved 
into level by use of move-down while maintaining l[l]-l|4] Calls to fix for the levels we have 
altered will ensure that ijsj-ljs] will be maintained, finally a call to rebalance-below(i — 1) 
will ensure that l[9]is maintained by use of shift-up(j) which will take climbing points from 
level j and make them arriving in level j + 1 for j = 0,...,z — 1. Insert (e) uses find to find 
the level where e intersects, then it uses fix to ensure the size constraints and finally e is 
moved to level by use of search. 

H Find(e) - returns the level i of the interval that e intersects along with e's type and 

whatever e is in the dictionary or not. [R&G: /|7]-/j^ 
H Fix(i) - moves points around inside of Bi to ensure the size invariants for each type 

of point. Fix(i) might violate ijojfor level i. [R: /{Tj-jj^ and that there exist ci, . . . ,C6 

such that \Di\ + ci,\A,\ + C2, \Ri\ + C3, \Wi\ + C4, \Hi\ + C5, \Ci\ + cq fulfill /{^/{H where 

IqI = 0(1) /ori= 1,...,6. G:I\^I^. 
H Shift-down(i) - will move at least 1 and at most c points from level i into level i — 1. [R: 

and \H,\ + |Q| = 4c22*^' + c[, where < = 0(1). G: 
H Shift-up(i) - will move at least 1 and at most c points from level i into level i + 1. [R: 

and \H,\ + |C,| = 4022'+" + c'^, where c<c', = 0(1). G: 
B Move-down(e, i, J, fbefore, ^after) - If 6 is in the dictionary at level i it is moved from level 

i to level where i > j. The type tbcforc is the type of e before the move and taftcr is 

the type that e should have after the move, unless i = j in which case e will be made 

arriving in level j. [R&G: /|7]-/j^. 
H Rebalance-below(i) - If any c < q for I ~ 0, . . . ,i rebalance-below(i) will correct it so lj9] 

will be fulfilled again for 1 = 0,..., i. [R: and J^]^^ slack{ci) = 0(1), where 

if ci e [-c; c] , 
\ci \ — c otherwise . 



slack{ci) 



G: /|7]-/1|/. 

B Rebalance-above(i) - If any q < — c for / = i, . . . , m — 1 rebalance-above(z) will correct it 
so l|9] will be fulfilled again ior I = i, . . . ,m - 1. [R: and YZ^^ slack{ci) = 0(1) . 

G:I\^I\^. 

Find(e) We start at level i = 0. If e < niin(P) or max(P) < e we return false and 0. For 
each level we let ei = pQ^.(e), 62 = S(3^.(e), p = PBi\Gii'^) s = SB.\Q.{e). We find p and 
s by querying each of the structures Di, Ai, Ri, Wi, Hi and Ci, we find ei and 62 by querying 
Gi and comparing with the values of ei and 62 from level i — 1. While p < ei and 62 < s 
we continue to the next level, that is we increment i. Now outside the loop, if e G Bi we 
return i, the type of e and the boolean true as we found e, else we return i and false as we 
did not find e. See Figure^ for an example of the execution. 

Predecessor(e) (successor(e)) We start at level z = 0. If e < min(P) then return —00 
(min(P)). If max(P) < e then return max(P) (00). For each level we let ei = pQ^.(e), 
p = p^. (e), 62 = SG<.(e) and s = s^. (e). While p < ei and 62 < s we continue to the next 
level, that is we increment i. When the loop breaks we return max(ei,p) (min(s,e2)). See 
Figure |4] for an example of the execution. 
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I Figure 4 The last three iterations of the while-loop of find(e), predecessor(e) and successor(e) 



Insert(e) If e < min(P) we swap e and min(P), call fix(O), rebalance-below(m) and return. 
If max(P) < e we swap e and max(P), call fix(O), rebalance-below(m) and return. 

Let Q = GILci(e), Cr = GIRci(e), hi = GIL//, (e) and hr — GIR/f. (e). We find the level i 
of the interval (ei; 62) which e intersects using find(e). 

If e is already in the dictionary we give an error. If |c/| > or \cr\ > or (ei; 62) is of 
type [ei; 62] and does not contain non-climbing points then insert e as climbing at level i. 
Else if -I- 1 -I- > c then insert e as climbing at level i and make the points in hi and 
hr climbing at level i. Else insert e as helping at level i. Finally we call rebalance-below(m) 
and then search(e) to move e from the current level i down to level 0. 



Search(e) We first find e's current level i and its type t, by a call to find(e). If e is in the 
dictionary then we call move-down(e, i, 0, t, arriving) which will move e from level i down to 
level and make it arriving, while maintaining ljT]-l|8] but l|9] might be broken so we finally 
call rebalance-below(i — 1) to fix this. 



Fix(i) In the following we will be moving elements around between Di^ Ai, Ri, Wi, Hi and 
Ci. The moves Ai Ri and Ri — ?> Wi, i.e. between structures which are next to each other 
in the memory layout, are simply performed by deleting an element from the left structure 
and inserting it into the right structure. The moves Wi HiU Ci and the other way around 
HiU Ci ^ Wi will be explained below. 

If \Di\ > Wi then perform Di A Ai where h ~ \Di\ — Wi. If \Di\ < Wi and \Bi\{iam{P), 
max(P)}| > |A| then perform H, U C, W.,, Wi R^, Ri A, and A, H D, where 
hi = min(u;,-|A|, \Hi\ + \C^\), /la = min(u;,-|A|, \Wi\ + hi), h^ = min(w,-|A|, |i?»| + ^2) 
and h^ = min(i/7i — \Di\, \Ai\ + /13). 

If 114^,1 -I- ji/il + |Cj| 7^ and |Pi| < 2^'^" then perform H, U C, ^ Wi and W^ H P, where 
hi = min(22'"'' - |P,|, \Hi\ + \Ci\) and /i2 = min(22*+' - |Pi|, \W^\ + hi). If |P,| > 2^'^" then 
perform Ri — ^ Ai where hi = \Ri \ — 2'^ * . 

li i < m and \Ai\ + \Wi\ < 2^'^'° then perform HiUCi ^ Wi, where hi = min(2^'^'' - 
{\A,\ + \W,\), \H,\ + lai). If \A,\ + \W,\ > 22'+" then perform W^ ^ U C, where hi ^ 
mm{\A,\ + m-2^'^\m). 

If l^il > 2^ then let hi = \Ai \ ~ 2'^ , delete Wi as it is empty and rename Ri to Wi. 
Now move hi elements from Ai into a new moveable dictionary X, rename Ai to Ri, rename 
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X to Ai and perform Wi ^ HiU Ci. 

Performing Wi HiU d: Let w — s\Yi{—oo), ci = GILci(w), Cr = G\Rci{w), hi — GIL//.(u;) 
and hr = G\RHi{w). If |q| > or \cr\ > or (ei;e2) is of type [ei;e2] and only contains 
climbing points then make w climbing at level i. Else if + 1 + > c then make ft./, w 
and hr climbing at level i. Else make w helping at level i. 

Performing HiU Ci ^ Wi. Let w be the minimum element of S/^. (— oo) and Sp. (— oo), and 
let Cr = GIR(7;(?i;). Make w waiting at level i. If w was climbing and \c^\ < c then make 
helping at level i. 

Shift-down («) We move at least one element from level i into level i — 1, see Figure |4] 
If \Di\ < Wi then we let a be some element in _D,;. If |_D,;| < \Bi\ then: if \Ai\ = we 
perfornj^i?, U d ^ W,, H and -> A„ where hi = min(l,|i/i| + |C,|) and 
/i2 = inin(l, + hi), now we know that \Ai\ > so let a = Syi;(— oo), i.e., a is the leftmost 
arriving point in Ai at level i. We call move-down ( a, i, i — 1, arriving, climbing). 

Shift-up(i) Assume we are at level i, we want to move at least one and at most c arbitrary 
points from Bi into B^+i. Letj^ si — sci(— oo), ei = Pq^.(si) and 62 = sg<.(si), and let 

S2 = SCin[ei;e2](«l)> = SCiflleiiea] (S2) , S4 = Sc,n [ei ;e2] (ss) and S5 = Sc,n [ei ;e2] (■S4) , if they 

exist, also let Cr = GIRc. (54) be the group of climbing elements to the immediate right of 54, 
if they exist, see Figure [5] We will now move one or more climbing points from Bi into 
where they become arriving points. Ifz = m— lorz^m then we put arriving points into 
Di^i, which we might have to create, instead of Ai^i. 

Wc now deal with the case where (ei; 62) is of type [ei; 62] and only contains climbing 
points. Let I be the level of ei's left interval, and r the level of 62 's right interval, also let c/ 
be the number of climbing points in the interval. If Z = i + 1 we make ei arriving, else we 
make it guarding, at level i -f 1. Make the points of si, S2, S3 and 54 that exist arriving at 
level i + 1. If c/ < c then make S5 arriving at level i + 1 if it exists, also if r = i + 1 we make 
62 arriving, else we make it guarding, at level z -I- 1. Else make 55 guarding at level i. 

We now deal with the cases where (ei; 62) might contain non-climbing points. If p(si) = ei 
we make si and S2 waiting and guarding at level i, respectively, else we make si guarding at 
level i and S2 arriving at level i + 1. Now in both cases we make S3 arriving at level i + 1 
and S4 guarding at level i. If ((54; 62) is not of type [54; 62] or contains non-climbing points) 
and \cr\ < c, i.e. there are less than c consecutive climbing points to the right of S4, then we 
make the points Cr helping at level i. 

We have moved climbing points from Bi into and made them arriving. Finally we 

call fix(i-|- 1). 

Move-down(e, i, J, tbefore, ^after) Depending on the type iboforc of point e we have different 
cases, see Figure |5] 

Non-guarding Let ei = p^^^ (e), 62 = s^^ . (e) and let / be the level of the left interval of ei and 
r the level of the right interval of £2. Also let p2 = Ps.\G.n[ei;e2](Pi)i Pi = PsAGin[ei;e2](s)> 



^ The move HiVJ d \ Wi will be performed the same way as we did it in fix. 
* See the analysis in Section 4 for a proof that \Ci\ > 0. 
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P Figure 5 Here we see illustrations of how we maintain the intervals when updating the intervals. 
These only show single cases of each of the update methods many cases. 



si = s_B,\G,n[ei;e2](e) and S2 = SB^\Gin[ev,e2]isi)^ ^Iso let Q = FGLcin[ei;e2](e) be the elements 
in the first climbing group left of e, likewise let = FGR(7.n[gj.g2](e) be the elements in the 
first climbing group right of e. 

Case i — j'. make e arriving in level j, if |c;| < c then make the points in c; helping at 
level j, if \cr\ < c then make the points in Cr helping at level j. Finally call fix(j). 

Case i > j: If both p2 and pi exists we make pi guarding in level j and let e'l denote pi, 
else if only pi exists we make ei guarding at level min(^, j) and pi of type iaftcr at level j 
and let e'l denote ei, else we make ei guarding in level min(Z,j), and let e'l denote ei. If 
both si and S2 exists we make si guarding at level j, and let e'2 denote si, else if only si 
exists we make si of type taftor at level j and make 62 guarding at level min(j, r) and let 
e'2 denote 62, else we make 62 guarding at level min(j, r) and let e'2 denote 62- Lastly we 
make e of type taftor in level j. Now let cj denote the elements of c; which we have not 
moved in the previous steps, likewise let cj, denote the elements of Cr which we have not 
moved. If ((ei; e'^] is not of type [ei; e'l] or contains non-climbing points) and |cj| < c then 
make cj helping at level i. If ([62; 62) is not of type [e'2', 62] or contains non-climbing points) 
and |cj,| < c then make helping at level i. Call fi-x(i), fix(j), fix(min(/, i)) and fix(min(i, r)). 

Guarding If e = min(P) or e = max(P) we simply do nothing and return. Let ei — PG<fc(c) 
be the left endpoint of the left interval (ei; e[ lying at level h and 62 — SG<h (e) be the right 
endpoint of the right interval [e; 62) lying at level i, we assume w.l.o.g. that h > i, the case 
h < i is symmetric. Also let I be the level of the left interval of ei and r the level of the right 
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interval of e2. Let p2 = Pi3,AG„n[ei;e] (Pi) and pi = PB;,\Ghn[ei;el(e) be the two left points of 
e, if they exists, si — s^.^G-nfeiea] (g) and S2 — SB.\G.n[e;e2](si) the two right points of e, if 
they exits. Also let q = FGLc.n[ei;e] (e) and = FGRc.n[e;e2](c)- 

If P2 does not exist we make ei guarding at level niin(/,j), we make pi of type taftor at 
level j and let e'l denote ei, else we make pi guarding at level j and let e'l denote pi. If it 
is the case that i > j then we check: if S2 does not exist then we make si of type iaftor at 
level j, 62 guarding at level min(j, r) and let 62 denote 62, else we make si guarding at level 
j and let 62 denote si. We make e of type tafter at level j. 

Now let c'l be the points of c; which was not moved and cj, the points of which was 
not moved. If |cj| < c then make cj helping at level h. We now have two cases if e'2 exists: 
then if |c^| < c then make cj, helping at level i. The other case is if 63 does not exist: then if 
{{e'l, 62) is not of type [e[] 62] or contains non-climbing points) and |c^| < c then make cj, 
helping at level i. In all cases call fix(min(/, /i)), fix(/i) and fix(i). If i > j then call fix(j) 
and fix(min(j, r)). 

Delete(e) We first call find(e) to get the type of e and its level i, if e is not in the dictionary 
we just return. If e is in the dictionary we have two cases, depending on if e is guarding or not. 

Non-guarding Let c/ = GILci(e) be the elements in the climbing group immediately left of e, let 
Cr = GIRci(e) be the elements in the climbing group immediately right of e, let hi = GIL//, (e) 
be the elements in the helping group immediately left of e, and let hr — GIR//;(e) be the 
elements in the helping group immediately right of e. Let ei = PG^.(e) and let 62 = SG^.(e). 
Let I be the level of the interval left of ei and r the level of the interval right of 62- 

We have two cases, the first is |]ei; e2[n-Bi| = 1: ii I > r make ei guarding and 62 
arriving at level r, if Z < r then make ei arriving and 62 guarding at level I. li I — r and 
|P| — n > 4 then make ei and 62 arriving at level I — r. Delete e, call fix(r), fix(Z), fix(i) 
and rebalance-above (1). 

The other case is |]ei; e2[ni3i| > 1: If ((ei;e2) is not of type [ei;e2] or contains non- 
climbing points) and \ci \ + \cr \ < c then make q and Cr helping at level i. li \hi \ + \ hr\ > c 
then make hi and hr climbing at level i. Delete e, call fix(i) and rebalance-above(l). 

Min-guarding If e min(P) then let e' = SG^^^(e) and e" — SG^„^{e') where is the level 
of (e; e') and i is the level of (e';e"). The case of e = max(P) is symmetric. Also let 

Sl = SBo\Gon[e;e'](e), S2 = SBo\Gon [e;e'] (si ) , ^1 = SB,\G,n[e';e"] (e') and ^2 = SB,\G,n [e';e"] (^1 ) • 

If S2 exists then delete e make Si guarding at level and call fix(O). If S2 does not exist 
and t2 exists then delete e make Si and ti guarding and e' arriving at level and finally call 
fix(O) and fix(j). If S2 does not exist and ^2 does not exist then delete e, make Si and e" 
guarding and e' and ti arriving at level and finally call fix(O) and fix(i). In all the previous 
cases return. 

Guarding Let h be the level of the left interval (ei : e[, let i the level of the right interval 
[e : 62) that e participates in. We assume w.l.o.g. that h > i, the case h < iis symmetric. Let 
I the level of the left interval that ei participates in, where ei = pQ^^(e) and 62 = SG<fe(e). 
Let p2 = Ps,AGhn[ei;e](Pi) and pi = PijAG„n[ei;e] (e)- Let q = FGLcJe) be the points in the 
first group of climbing points left of e. 

If p2 exist we make pi guarding at level i, and let e' denote pi, else we make ei guarding at 
level min(Z, i), let e' denote ei and if [e'; 62) is of type [e'; 62] and contains only climbing points 
then we make pi climbing at level i else we make pi waiting at level i. Let cj be the points 
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in ci which was not moved in the previous movement of points. If |cj| < c make cj helping at 
level h. If e' is ei then call fix(Z). Delete e, call fix(/i), fix(i) and rebalance-above(l). 

Rebalance-below(i) For each level Z = 0, . . . , z we perform a shift-up(/) while c < q. 

Rebalance-above(i) For each level Z = z, . . . , to — 1 we perform shift-down(Z + 1) while 
ci < -c. 

3 Memory management 

We will now deal with the memory layout of the data structure. We will put the blocks in the 
order Bq, . . . , Bm , where block Bi further has its dictionaries in the order Di , Ai , Ri , Wi , Hi , Ci 
and Gi , see Figure [T] Block Bm grows and shrinks to the right when elements are inserted 
and deleted from the working set dictionary. 

The Di structure is not a moveable dictionary as the other structures in a block are, it 
is simply an array of Wi = d2^^^ elements which we use to encode the size of each of the 
structures Ai, Ri,Wi, Hi,Ci and Gi along with their own auxiliary data, as they are not 
implicit and need to remember 0(2'+'^) bits which we store here. As each of the moveable 
dictionaries in Bi have size 0{2^ ) we need to encode numbers of 0(2*+*^) bits in Di. 

We now describe the memory management concerning the movement, insertion and 
deletion of elements from the working-set dictionary. First notice that the methods find, 
predecessor and successor do not change the working-set dictionary, and layout in memory. 
Also the methods shift-down, search, rebalance-below and rebalance-above only calls other 
methods, hence their memory management is handled by the methods they call. The only 
methods where actual memory management comes into play are in insert, shift-up, fix, move- 
down and delete. We will now describe two methods internal-movement - which handles 
movement inside a single block/level ~ and external-movement - which handles movement 
across different blocks/levels. Together these two methods handle all memory management. 

lnternal-movement(TOi, . . . , mi) Internal-movement in level i takes a list of internal moves 

nil, ■ ■ ■ T^ni to be performed on block Bi, where I — 0(1) and move mj consists of: 

B the index 7 = Di, Ai, Ri,Wi, Hi,Ci,Gi of the dictionary to change, where we assum^ 

that TOj.7 < rrih-j, for j < h, 
B the set of elements to put into 7, where |S'in| — 0{1), 
H the set of elements ^out to take out of 7, where |5'out| = C(l) and 
H the total size difference <5 = |S'in| — |S'out| of 7 after the move. 

For j = I, . . . ,1 do: ii nij.S < then remove ^out from 7, insert Sin into 7 and move 
7 -|- 1, . . . , G left |TOj.(5| positions, where we move them in the order 7 -I- 1, . . . , G. If rrij.S > 
then move 7 -t- 1, . . . , G right rrij.S positions, where we move them in the order G, . . . , 7 -I- 1, 
remove S'out from 7 and insert Sm into 7. See Figure [6] 

It takes C(log(22'+')) = 0{2'+'') time and 0(logB(22"'')) = O(g^) cache-misses to 
perform move j. In total all the moves mi, . . . ,mi use 0(2*+'^) time and 0{ ^^ g ) cache-misses, 
as I = 0(1). 



^ We will misuse notation and let 7 -|- 1 denote the next in the total order D, A, R, W, H, C, G. We will 
also compare mj.7 and 771^.7 with < in this order. 
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Internal-movement(mi. . . . ,mi) External-movement(A/i, . . . , A//) 
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Figure 6 (Left) Memory movement of internal-movement inside of a block Bi. (Right) Memory 
movement of external-movement across multiple blocks Bmi.-j, ■ ■ ■ , Bmi.-^- 



External-movement(Mi , . . . , M;) External-movement takes a list of external moves Mi , . . . , 
Ml, where / = 0(1). Move Mj consists of: 

H the index < 7 < m of the block/level to perform the internal moves mi, . . . , ruq on, 

where Mj.7 < Mh-^ for j < h, 
H the list of internal moves mi, ... , m^ to perform on block 7, where q — 0(1), and 
H the total size difference A — X]^=i "^;i.<5 of block 7 after all the internal moves mi, . . . , m^ 

have been performed. 

Let A = Y^[^^ Mi.A be the total size change of the dictionary after the external- moves 
have been performed. If A = then we let 7ond — Mi.j else we let 7ond — m. Let pond — 
S J=o l-Bj I + A be the last address of the right most block that we need to alter. Let si,. . . ,Sk 
be the sublist of the indexes {!,...,/} where Mg^.A < for i = 1, . . . , k. Let ai, . . . ,ah be 
the sublist of the indexes {1, . . . , ^} where Ma^ .A > for i — 1, . . . ,h. 

We first perform all the internal moves of each of the external moves M^^ , . . . , M^^. . Then 
we compact all the blocks with index i where Mi. 7 < i < 7cnd so the rightmost block ends 
at position Pond- Finally for each external move Ma^ for i = I, . . . ,h: move Bm^,. .7 left so it 
aligns with Bm^.. -y-i and perform all the internal moves of Mq. , then compact the blocks 
Bmc-.j+I: ■ ■ ■ j^-A/a^i.7-1 at the left end so they align with block BMa .-y- 

It takes O (/log (22*+')) = O [12'+^) time and O (/log^ (22'+')) = O (/^) cache- 
misses to perform the internal moves on level i. In total all the external moves Mi , . . . , Mi use 
Q{2'tcnd+k^ time and O ( ^"[""''g ^ cache- misses, as the external move at level 7ond dominates 
the rest and / = C(l). 

3.1 Memory management in updates of intervals 

With the above two methods we can perform the memory management when updating the 
intervals in Section [2.3| Whenever an element moves around, is deleted or inserted, it is 
simply put in one or two internal moves. All internal moves in a single block/level are 
grouped into one external move. Since all updates of intervals only move around a constant 
number of elements, the requirements for internal/external- movement that I = 0(1) and 
q = 0(1) are fulfilled. From the above time and cache bounds for the memory management 
the bounds in Theorem [T] follows. 

^ Analysis 

We will leave it for the reader to check that the pre-conditions for each methods in Section 
12.31 are fulfilled and that the methods maintains all invariants. We will instead concentrate 
on using the invariants to prove correctness of the find, predecessor, successor and shift-up 
operations along with proving time and cache-miss bounds for these. We will leave the time 
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and cache-miss bounds of search, rebalance- above, rebalance-below, shift-down, insert, delete 
and fix for the reader as they are all similarly in nature. 

Find(e) We only consider the cases where min(P) < e < max(P), the other cases trivially 
gives the correct answer in 0(1) time and cache-misses as min(P), max(P) e Gq. Assume 
that find(e) stops at level i, then we have that ei < p or s < 62 so /(ei, 62, i) 7^ and i is the 
minimal i where this happens, see lemma|lj Notice that ei = PG<i(6) and 62 = SG<.(e), so ei 
and 62 are the same as in lemma [T] When the while loop breaks we have all the preconditions 
for lemma [1] Now e is either in the dictionary, or not, and if e is in the dictionary it is either 
guarding or not, so we have three cases. 

Case 1) e is in the dictionary and is non-guarding: then we have from lemme [ij^that 
(ei; 62) is a interval at level i and e S Bi. From this we also have that log(^e) > log(2^ ). 

Case 2) e is not in the dictionary: from lemma [l](ei; 62) lie at level i and we know that e 
intersects it. Since e is not in the dictionary £e = n and then log(£e) > log(2^ ). 

Case 3) e is in the dictionary and is guarding: from lemma [l] we have that either 
(ei; e) or (e; 62) lie in level i, hence e G Gi C Bi. From this we also have that log(^e) > 
log(2^-'-^'"^"^)>log(22-"^"^). 

From the above we see that find(e) runs in 0(log(2^ )) = C'(logmin(£p(e),fe,4(e))) 
time. When we look at the cache-misses we will first notice that the first [loglogiJj levels 
will fit in a single cache-line because all levels are next to each other in the memory layout, 
so the total cache-misses will be 




= C'(logBmin(^p(e),4,4(e)))- 



Predecessor(e) (and successor(e)) We will only handle the predecessor operation, the case 
for the successor is symmetric. Since we have the same condition in the while loop as for find, 
we know that when it breaks it implies that /(ei, 62, i) 7^ 0. So from lemma [l] e intersects a 
interval at level i and the predecessor of e is now max(ei,p). 

From ijijwe know that log(^p) > log(2^ ^ ) and the total time usage is J2]=o ^O^S.i'^^ ^ )) 
~ 0(2*+*^) = C'(log(^p)). Like in find, the first [loglogPj levels fit into one block/cache-line 
hence the total cache-misses will be 0{logg{£p)). 

Shift-up(i) For shift-up to work for level i it is mandatory that \Ci\ > so that sp. (— cx)) 
will return a element which can be moved to level i + 1. From the precondition that 
\H,\ + \C,\ 4c22'*'° + c^, where c < = we have that 

\C,\ = 4c22*+' + c', - > 4c22'^' - c - \U,\ 

so proving that \Ri\ < 4c2^'^'' — c is enough. From l|3]we can at most have c — 1 helping 
points in a helping group, so for every c — 1 helping points we need a separating point, the 
role of the separating point can be played by a point from Di, Ai,Ri, Wi or G<i-i. These 
are the only ways to contribute points to Hi hence for i > 1 we have this bound 

m < (c-l)(|A| + |A,| + |i?,| + |Ty,| + |G<,_i|) 

< (c - 1) + 2 • 22'+' + ^ ((4 + 2d + 8c)22'+' + 2c) j 

(c - 1) • 2'+'' + 2 • 22'*' -f (4 -I- 2d -I- 8c) • 2 • 2^^^"'' + 2ci^ 
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Where we in (*) have used iji] l|6]lj7]and o[l] and in (**) have used that 2^' = 2^' ' • 2^' 
and 2^ > I for I > 1. If we use that c = 5 then for k > foglog(380 + 20d) + 1 we have that 
IQI > 4c22'+' - c - > for i = 1, . . . , m - 1. 

For z = we have a different bound as G'<i_i is empty, we get the bound 

\Ho\ < (c-I)(|A| + |A,| + |i?,| + |W,|) 
< (c-1) (d-2'+'= + 2-22'+') 

but for k > foglog(380 + 20d) + 1 this is of course still sufficient as |-ffo| only got smaller. So 
we have proved that \Ci\ > for level i = 0, . . . , m — 1. 

Move-down(e, ibefore, ^after) Move-down moves a constant number of points around and 
into level j from i. If e is non-guarding we call fix(i), fix(j'), fix(min(Z, i)) and fix(min(i, r)). 
If e is guarding we call fix(min(Z, h)), fix(/i) and fix(i), and if i > j we also call fix(j) and 
fix(min(j, r)). In the non-guarding case the time is bounded by C'(log2^ ) = ©(log^e) and 
the cache-miss bounds are dominated by 0{logg 2^ ) — 0(log^ £e)- In the guarding case 
the time is bounded by C(log2^ ) = ©(log^e) and the cache-miss bounds are dominated 
byO(logB22'^'=) = 0(logB4). 

1 5 Further work 

We still have some open problems. Is it possible to change the insert operation such that when 
we insert a new point it will get a working-set value of n + 1 instead of 0? We can actually 
achieve this in our structure by loosening the invariant on the working-set number of guarding 
points to only require that they have a working-set number of at least 2^ ' '^'^ , but then 
for search the time will increase to ©(log min(£e, max(£p(g'), 4(g)))) and the cache-misses to 
0(10 

gg min(4, max(^p(g'), 4(e)))) and the bounds for predecessor and successor queries would 
increase to 0(logmax(i'p(g-), 4(e))) time and 0(log^ max(^p(g'), 4(e))) cache-misses. 

Another interesting question is if we can have a dynamic dictionary supporting efficient 
finger searches [S] in the implicit model, i.e., we have a finger / located at a element and 
then we want to find an element e in time ©(log d(/, e)), where d(/, e) is the rank distance 
between / and e. But very recently [Tl] have shown that finger search in 0{logd{e, f)) 
time is not possible in the implicit model. They give a lower bound of Q(logn). Now we 
could instead separate the finger search and the update of the finger, say we allow the finger 
search to use 0{q{d(e, /))) time for some function q. In this setting they also prove a lower 
of fl{q~^{logn)) for the update finger operation, where q~^ is the inverse function of q. 
They also give almost tight upper bounds for this setting, in the form of a trade-off bound 
between the finger search and the update finger operations. The finger search operation 
uses 0(log d{e, /)) -I- q{d{e, /)) time, and the update finger operation uses 0{q~^ {log n) log n) 
time. But even given their result it still remains an open problem whatever dynamic finger 
search with an externally maintained finger is possible in C'(logd(e, /)) time. So in other 
words is it possible to do finger search in ©(log d{e, /)) time if we allow the data structure 
to store ©(logn) bits of data that can store the finger? 
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